Skip to content

Use Unicode Escape Sequence to replace encoded characters#2814

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
drasticactions:unicode-escape-sequence
Aug 26, 2023
Merged

Use Unicode Escape Sequence to replace encoded characters#2814
ggerganov merged 1 commit intoggml-org:masterfrom
drasticactions:unicode-escape-sequence

Conversation

@drasticactions
Copy link
Contributor

@drasticactions drasticactions commented Aug 26, 2023

Using special characters within source files can break compiling on some computers with different regions and language settings. I have a ja-JP Windows 11 setup, and trying to compile the current master branch fails on find_bpe_rank due to the special characters recently introduced. Note that using a compiled build is fine; only compiling itself fails.

Using Unicode escape sequences should allow the code to be compiled on all setups without changing your computer's settings or switching regions. Trying out my changes and it seems like everything processes as it should, but hopefully others with more C++ experience know if I screwed something else up here.

e. Searching through the other repos, similar techniques have been done before, so I'm feeling more confident now in this.

The use of special characters within source files can break compiling on some computers with different region and language settings. Using Unicode escape sequences should allow for the code to be compiled on all setups without needing to change your computers settings or switch regions.
@ggerganov ggerganov merged commit c7d92e6 into ggml-org:master Aug 26, 2023
@drasticactions drasticactions deleted the unicode-escape-sequence branch August 27, 2023 00:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants